CDS

Accession Number TCMCG075C23072
gbkey CDS
Protein Id XP_007017556.2
Location 1214194..1216161
Gene LOC18591394
GeneID 18591394
Organism Theobroma cacao

Protein

Length 655aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007017494.2
Definition PREDICTED: long-chain-fatty-acid--AMP ligase FadD26 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category IQ
Description Disco-interacting protein 2 homolog
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0006629        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008610        [VIEW IN EMBL-EBI]
GO:0009058        [VIEW IN EMBL-EBI]
GO:0009273        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0042546        [VIEW IN EMBL-EBI]
GO:0044085        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0071554        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0071766        [VIEW IN EMBL-EBI]
GO:0071840        [VIEW IN EMBL-EBI]
GO:1901576        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAATTATGAAAACTATGATCCTTCTTTCCCTGACCAACCAGTTGTTGATCAATACCTTCCCATATGGGCTAGCCTACCAGCCTTCAGGTCCAAGCCAGCCTTCATTTGGCCCGACGATGGCTCGACCGATGTAAGCAAGAGCTCAACACTTACCTATGCTCAGCTTAATGATTCAGTGCAGTCCATTTCCTTTCAGCTACTCCTTTCATTGCAAAGAGGTGACACAATAGTCATTTTATGCTCACCTGGGCTAGAGCTCGTTGAGATCATTTTTGGGTGTCAACGAGCTGGCCTTTTAAGCGTACCCATAATCCCACCAGACCCTTCTTTCGCCAAAGAAAACTATCACCACCTAGTCAGAGTTCTCTCCCAGACAAAGCTCAAAGCTGCCATAGCTCACCATGATTACATCACAAGGGTTCAGCAATATCTCTCTTCGCCCTCTAAAGACGAGAGGCTTGCAGGGATGTTACAAAACCTGATATGGATTTCGACTGGAGATATCAAACATAAAAATGTAGATTCAACAGCGGGTTCTATGTTTTATAATGGTTGCAAACCTGACGAACTTTACTTGATTCAATACACCTCAGGTGCAACAGGGATCCCAAAGCCCGTGCTTGTGACAGCTGGATCAGCAGCTCACAATGTTAGGACAGCAAGGAAAGCTTATGATTTGCATCCAAATAGTGTTATTGTCTCTTGGTTACCTCAATACCACGATTGTGGCCTAATGTTTCTGTTATTGACCATTGTATCTGGTGCAACATGCGTCTTAACATCACCTGGAGCTTTCGTTAACCGGCCTAGGCTATGGATTGAGCTAATTACAGAGTTCAAGGCTACTTGTACTCCAGTTCCATCCTTCACCCTTCCACTAGTCGTGAAGCGTGGTGGAGTTGAAAAAGGAAGCTCGCCTATTAATCTATGGAGCTTGAGAAATCTCATAATCATCAATGAGCCCATTTACAAGGCATCAGTTGAAGAATTTCTTGATGTGTTTAAGCCATTTGGACTAAACCCATCGTCAATCTCTCCATCTTATGGCTTAGCAGAGAATTGCACATTTGTTTCCACAGCGTGGAGAAACAATGACAACTCTGGAAACTCCAGTTTCCCTCATCTTCCTTCTCACAACAAGCTGCTACCAAGTGCAAGACTTGCTAATGAAGAAGAAGAAGAGGACATGAACATTATTGTTGTAAATGAGGACACCCATGAGCCTGTTGAGGACGAAATTGAGGGTGAGATTTGGGTTTCATCTCCAAGCAATGCTTCTGGTTACCTAGGCCATCCTTTCTTAACTCAAGATATATTTAAAGGTAGACTGAGCAACAAGGCTGGCCGGTGTTTTGTTCGAACAGGAGACAGAGGGATTGTGAAAGGGGCAGAAAGATTTCTCTTTGTGACAGGTCGTTGCCTAGACGTCGTTAAGCTCCCAAACGGTCAGGATATGCACCCTCATTACATAGAGACCACTGCTTATAATACTTGCCCACAGCTTATTAGAGGAGGTTGTCTTGCTGCATTTGATATCTCGAGAATGATCGTTCTTGTTGCAGAGATGCAGAGGAGTGAAAAAGATAACAAGATTTTGAGGGACATATGTGAAAAAATGAGAGAAACGGTTTTGAATCAAGAGAAAGTCGAATTAGGGATGGTAGTTCTTGTAAAAAGTGGAAGTGTTCCGAAAACTACTTCAGGTAAAATTCAAAGATGGGCGGCCAAGGATAACTTTCTAGGAGGTAAAATGAAAGTTTTAATGGAAATGAAGTTTGATAATTACCATGGGGTTTTATTACCATCCCCTGGGGCAATGATACTAGCAAGTAAGGGAAGAGGACAAAGGATAGGAAAAGGAAGAGAAGGTGAAGAGGGAAGAGCCCTGATAGCTGAAGAGAAAGAAGAGATTCCTTTTTCACTGTCAAGTGCTCCAACTCGTCATCCCTGGTTGTCTCGATTGTGA
Protein:  
MNYENYDPSFPDQPVVDQYLPIWASLPAFRSKPAFIWPDDGSTDVSKSSTLTYAQLNDSVQSISFQLLLSLQRGDTIVILCSPGLELVEIIFGCQRAGLLSVPIIPPDPSFAKENYHHLVRVLSQTKLKAAIAHHDYITRVQQYLSSPSKDERLAGMLQNLIWISTGDIKHKNVDSTAGSMFYNGCKPDELYLIQYTSGATGIPKPVLVTAGSAAHNVRTARKAYDLHPNSVIVSWLPQYHDCGLMFLLLTIVSGATCVLTSPGAFVNRPRLWIELITEFKATCTPVPSFTLPLVVKRGGVEKGSSPINLWSLRNLIIINEPIYKASVEEFLDVFKPFGLNPSSISPSYGLAENCTFVSTAWRNNDNSGNSSFPHLPSHNKLLPSARLANEEEEEDMNIIVVNEDTHEPVEDEIEGEIWVSSPSNASGYLGHPFLTQDIFKGRLSNKAGRCFVRTGDRGIVKGAERFLFVTGRCLDVVKLPNGQDMHPHYIETTAYNTCPQLIRGGCLAAFDISRMIVLVAEMQRSEKDNKILRDICEKMRETVLNQEKVELGMVVLVKSGSVPKTTSGKIQRWAAKDNFLGGKMKVLMEMKFDNYHGVLLPSPGAMILASKGRGQRIGKGREGEEGRALIAEEKEEIPFSLSSAPTRHPWLSRL